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Abstract. In this paper we revisit Safra's determinization constructions for automata on 
infinite words. We show how to construct deterministic automata with fewer states and, 
most importantly, parity acceptance conditions. Determinization is used in numerous ap- 
plications, such as reasoning about tree automata, satisfiability of CTL*, and realizability 
and synthesis of logical specifications. The upper bounds for all these applications are 
reduced by using the smaller deterministic automata produced by our construction. In 
addition, the parity acceptance conditions allows to use more efficient algorithms (when 
compared to handling Rabin or Streett acceptance conditions). 



One of the fundamental questions in the theory of automata is determinism vs. non- 
determinism. Another related question is the question of complementation. That is, given 
some machine in some complexity class can we produce a machine in the same class that 
accepts the complement language? The problems of determinization and complementation 
are strongly related. Indeed, if the machine is deterministic we just have to dualize its 
answer. If the machine is nondeterministic we do not have a simple solution. 

In the theory of finite automata on finite words the relation between nondeterministic 
and deterministic automata is well understood. We know that there exists an efficient pro- 
cedure that gets a nondeterministic automaton with n states and constructs a deterministic 
automaton with 2 n states accepting the same language [RS59]. This construction is tight 
[HMU00] . By dualizing the acceptance condition of the deterministic automaton we get an 
automaton for the complement language, which is again tight [HMU00J. 

In his proof that satisfiability of SIS is decidable, Biichi introduces nondeterministic 
automata on infinite words [Biic62| . Biichi takes a 'normal' finite automaton and runs it 
on infinite words. A run of such an automaton is an infinite sequence of states, instead of a 
finite sequence. The set of states recurring infinitely often is used to define the acceptance 
condition. A run is accepting according to the Biichi condition if the set of recurring states 
intersects the set of accepting states. 

2000 ACM Subject Classification: F.I.I Models of Computation, F.4.3 Formal Languages. 
Key words and phrases: determinization, finite automata, Biichi, Streett, parity. 
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In the case of finite automata on infinite words determinization and complementation 
are much more involved. Given a deterministic Biichi automaton one can easily construct 
a nondeterministic Biichi automaton for the complement language [Kur87| . However, de- 
terministic Biichi automata are not closed under complementation [Lan69] . This forced the 
introduction of more complex acceptance conditions such as Muller, Rabin, Streett, and 
parity. A Rabin acceptance condition is a set of pairs of subsets of the states. A run is 
accepting according to a Rabin condition if there exists a pair (E, F) such that the set of 
recurring states does not intersect E but does intersect F. The Streett condition is the 
dual of Rabin. A run is accepting according to a Streett condition if for every pair (R, G) 
we have that if G intersects the set of recurring states so must R. A parity condition gives 
an integer priority to every state and a run is accepting if the minimal recurring priority 
is even. The number of priorities is the index of the parity condition. Rabin and Streett 
conditions are more general than parity in the following sense. A parity condition of index 
2k can be written as a Rabin (or Streett) condition with k pairs (without modifying the 
structure of the automaton). We can translate a Rabin or Streett condition with k pairs 
to a parity condition of index 2k + 1 using a gadget with k 2 kl states, thus we multiply the 
number of states of the automaton by k 2 k\. All three conditions are strong enough to allow 
determinization [Tho 90] . 

In the case of automata on infinite words determinization and complementation are no 
longer so strongly coupled. Determinization can be used for complementation by dualizing 
the acceptance condition of the deterministic automaton. However, there are complementa- 
tion constructions that are much simpler than determinization. Specifically, Biichi showed 
that the class of languages recognized by nondeterministic Biichi automata is closed under 
complement without determinization [Biic62j. Sistla, Vardi, and Wolper suggested a singly 
exponential complementation construction [SVW85j . however with a quadratic exponent. 
This was followed by a complementation construction by Klarlund [Kla91j and a very el- 
egant complementation via alternating automata by Kupferman and Vardi [KVOlj . The 
latter construction was recently improved to give a complement automaton with at most 
(0.96n) n states [FK V04| . which is currently the best complementation construction. See 



McNaughton showed a determinization construction that is doubly exponential and results 
in an automaton with the Muller acceptance condition |McN66] . Safra gives a determiniza- 
tion construction which takes a nondeterministic Biichi automaton with n states and returns 
a deterministic Rabin automaton with at most (12) n n 2n states and n pairs |Saf88] . An al- 
ternative determinization with a similar upper bound that also results in a deterministic 
Rabin automaton was given by Muller and Schupp MS95J. Michel showed that this is 
asymptotically optimal and that the best possible upper bound for determinization and 
complementation is n\ AlicSH. L6d98]. 

Safra's idea is to use multiple subsets for one state of the deterministic automaton and 
organize them in the form of a tree. The root of the tree is the classical subset construction 
for automata on finite words. In every transition, a node with set of states S spawns a new 
son that includes all the accepting states in S. Thus, all the states in a leaf are the endpoints 
of runs that agree (more or less) on the number of times they have visited the acceptance 
set. In order to keep the tree finite, we ensure that every state is followed in at most one 

^Incidentally, both determinization constructions provided the best upper bound for complementation at 
the time of their introduction. 



also |Tho90| . 




Determinization constructions for automata on infinite words followed a similar 
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branch of the tree. Whenever a state is followed in more than one branch we keep only 
the copy in the oldest branch. Furthermore, whenever all the states followed by some node 
have visited the acceptance set, the node is marked as accepting and all its descendants are 
removed. The Rabin acceptance condition associates a pair with every node in the tree. 
There should be some node that is erased from the tree at most finitely often and marked 
accepting infinitely often for a run to be accepting. 

The fact that stronger acceptance conditions are introduced raises the question of de- 
terminization of automata using these conditions. Rabin and parity automata can be easily 
converted to Biichi automata. Given a Rabin automaton with n states and k pairs there 
exists an equivalent nondeterministic Biichi automaton with n(k + 1) states. Applying 
Safra's determinization on top of this automaton produces a deterministic Rabin automa- 
ton with (12) n ( fc+1 )(n(/c+l)) 2?1 ( fc+1 ) states and n{k + 1) pairs. For Streett automata, going 
through nondeterministic Biichi automata is far from optimal. A nondeterministic Streett 
automaton with n states and k pairs can be converted to a nondeterministic Biichi au- 
tomaton with n2 k states |Cho74| , which is optimal [SV89 ] . Combining this conversion with 
the determinization results in a doubly exponential deterministic automaton. In order to 
handle Streett automata, Safra generalized his determinization construction [Saf92| . Given 
a Streett automaton with n states and k pairs he constructs a Rabin automaton with 
(n/c)°( nfc ) states and 0(nk) pairs. 

We mentioned that the Rabin and Streett conditions are duals; the dual of the parity 
condition is parity again. Sometimes, given a nondeterministic automaton, we need to 
generate a deterministic automaton for the complementary language, a process called co- 
determinization (e.g., for converting alternating tree automata to nondeterministic tree 
automata). While complementing a deterministic automaton can be easily done by dualizing 
the acceptance condition, such a dualization for a Rabin or Streett automaton results in an 
automaton of the second type. Thus, co-determinization of a Biichi (or Streett) automaton 
results in a deterministic Streett automaton. Translating from Streett to Rabin or parity is 
exponential, we add a gadget with k 2 k\ states where k is the number of pairs of the Streett 
condition [Saf92]. Thus, we multiply the number of states of the automaton by k 2 k\. The 
translation of Rabin to Streett or parity is dual and has exactly the same complexity. 

Determinization has many uses other than complementation. For example, Rabin uses 
McNaughton's determinization of Biichi automata to complement nondeterministic Rabin 
tree automata |Rab72 j Fl A node in an infinite tree belongs to infinitely many branches. A 
tree automaton has to choose states that handle all branches in a single run. In many cases, 
we want all branches of the tree to belong to some word language. If we have a deterministic 
automaton for this word language, we run it in all directions simultaneously. This kind of 
reasoning enables conversion of alternating tree automata to nondeterministic tree automata 
and complementation of nondeterministic tree automata (cf. [Ra b72[ lTho9CH [Var98j ). 

Deterministic automata are used also for solving games and synthesizing strategies. In 
the context of games, the opponent may be able to choose between different options. Using 
a deterministic automaton we can follow the game step by step and monitor the goal of the 
game. For example, in order to solve a game in which the goal is an LTL formula, one first 
converts the LTL formula to a deterministic automaton and then solves the resulting Rabin 
game [PR89] (cf. [KV98, dAHMOl]). Using Safra's determinization, reasoning about tree 

2 Rabin uses this complementation in order to prove that satisfiability of S2S is decidable Rab72] , This 
is essentially the same use that Biichi had for the complementation of Biichi automata. In the context of 
tree automata one has to use a more general acceptance condition. 
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automata reduces to reasoning about nondeterministic Rabin tree automata and reasoning 
about general games reduces to reasoning about Rabin games. Some of these applications 
use co-determinization, the deterministic automaton for the complementary language. 

In this paper we revisit Safra's determinization constructions. We show that we can 
further compact the tree structure used by Safra to get a smaller representation of the 
deterministic automata. By using dynamic node names instead of the static names used by 
Safra we can construct directly a deterministic parity automaton. Specifically, starting from 
a nondeterministic Biichi automaton with n states, we end up with a deterministic parity 
automaton with 2n n n\ states and index 2n (instead of Rabin automaton with (12) n n 2n 
states and n pairs). Starting from a Streett automaton with n states and index k, we end 
up with a deterministic parity automaton with 2n n (£;+l) n ( fc+1 - ) (n(A;+l))! states and index 
2n(k + 1) (instead of Rabin automaton with (12) n ( fe+1 )n n (/c+l) n ( fc+1 )(n(/c+l)) n ( fc+1 ) states 
and n(k + 1) pairs). For both constructions, complementation is done by considering the 
same automaton with a dual parity condition. 

Though dividing the number of states by 12 n is not negligible, the main importance 
of our result is in the fact that the resulting automaton is a parity automaton instead 
of Rabin. Solving Rabin games (equivalently, emptiness of nondeterministic Rabin tree 
automata) is NP-complete in the number of pairs [EJ88]. Solution of parity games is in 
NPnco-NP. The current best upper bound for solving Rabin games is mn k+1 k\ where m 
is the number of transitions, n the number of states, and k the number of pairs [PP06J. 
Using our determinization construction instead of reasoning about Rabin conditions we can 
consider parity conditions. The best upper bound for solving parity games is mn k ^ 2 [JurOO] 
(cf. [BSV031 IJPZ06] for other solutions). That is, we save a multiplier of at least kk\. 

The gain by using our determinization is even greater when we consider applications 
that use co-determinization. As Streett is the dual of Rabin it follows that solving Streett 
games is co-NP-complete. Even if we ignore the computational difficulty, the Rabin accep- 
tance condition at least allows using memoryless strategies. That is, when reasoning about 
Rabin games (or Rabin tree automata) the way to resolve nondeterminism relies solely on 
the current location. This is not the case for Streett. In order to solve Streett games we 
require exponential memory [DJW97[ IHor05| . Applications like nondeterminization of alter- 
nating tree automata use co-determinization but require the result to be a Rabin or parity 
automaton. Hence, the resulting deterministic Streett automaton has to be converted to a 
parity automaton. Again, the price tag of this conversion is a blowup of k 2 kl where k is the 
number of pairs. As the complexity of reasoning about parity games is mn fc / 2 , the extra 
multiplier grows to (k 2 kl) k . 

Recently, Kupferman and Vardi showed that they can check the emptiness of an alter- 
nating parity tree automaton without directly using Safra's determinization |KV05j . Their 
construction can be used for many game / tree automata applications that require deter- 
minization. However, Kupferman and Vardi use Safra's determinization to get a bound 
on the size of the minimal model of the alternating tree automaton. Given such a bound, 
they can check emptiness by restricting the search to small models. Our improved con- 
struction implies that the complexity of their algorithm reduces from (12 ) n V n + 2n (n!) 2n 
to (2n n n!) 2n . 
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2. NONDETERMINISTIC AUTOMATA 

Given a finite set S, a word over £ is a finite or infinite sequence of symbols from E. 
We denote by S* the set of finite sequences over £ and by S w the set of infinite sequences 
over S. Given a word = o"oO"i<72 • • • £ S* U £ w , we denote by w[i, j] the word Oi ■ ■ ■ Uj. 

A nondeterministic automaton is A = (T,, S,5, sq,o), where S is a finite alphabet, 
S is a finite set of states, 5 : S x £ — » 2 s is a transition function, so G 5* is an initial 
state, and a is an acceptance condition to be defined below. A run of A on a word 
w = wqWi ■ ■ ■ is an infinite sequence of states sqSi ■ ■ ■ G S w such that sq is the initial 
state and for all j > we have Sj + \ G 5(sj,Wj). For a run r = sq s i " " > let m/(r) = 
{s 6 5 | s = Sj for infinitely many i's} be the set of all states occurring infinitely often 
in the run. We consider four acceptance conditions. A Rabin condition a is a set of pairs 
{(-Si, Fx), ■ ■ ■ , {Ek, Fk)} where for all i we have E± C S and Fi C 5". We call k the index of the 
Rabin condition. A run is accepting according to the Rabin condition a if there exists some 
i such that inf(r) C\ Ei = $ and inf(r) n i*i 7^ 0. That is, the run visits finitely often states 
from E{ and infinitely often states from Ft. The Streett condition is the dual of the Rabin 
condition. Formally, a Streett condition a is also a set of pairs {{Rx,Gx), ■ ■ ■ ,{Rk,Gk)} 
where for all i we have Ri C 5 and Gj C S 1 . We call A; the index of the Streett condition. A 
run is accepting according to the Streett condition a if for every i either inf(r) n Gj = or 
inf(r) r\ Ri ^ %. That is, the run either visits Gj finitely often or visits Ri infinitely often. 
As a convention for pairs in a Rabin condition we use E and F and for pairs in a Streett 
condition we use R and G. A parity condition a is a partition {Fq, . . . ,Fk} of S. We call 
k the index of the parity condition. A run is accepting according to the parity condition a 
if for some even i we have inf(r) n 7^ and for all i' < i we have inf(r) n -FV = 0. A 
Biichi condition a is a subset of S*. A run is accepting according to the Biichi condition a if 
inf(r) fla / i. That is, the run visits infinitely often states from a. A word w is accepted 
by iV if there exists some accepting run of iV over w. The language of iV is the set of words 
accepted by N . Formally, L{N) = {w \ w is accepted by iV}. Two automata are equivalent 
if they accept the same language. 

Given a set of states 5" C S and a letter a G S, we denote by 5(5', a) the set 
Uses' (J )- Similarly, for a word w £ S* we define 6(S',w) in the natural way: 5(S', e) = 
5' and 5(S',wa) = 5(5(S' ,w), a). For two states s and t and w G £*, we say that t is 
reachable from s reading w if i G <5({s},w;). 

An automaton is deterministic if for every state s G S 1 and letter cr G S we have 
|<5(s, cr)| = 1. In that case we write 5 : S x S — > 5. We use deterministic automata to 
complement a word automaton, i.e., construct an automaton that accepts the complement 
language. We can use deterministic automata also as monitors in games (see below). Deter- 
minization for automata on finite words is relatively simple [ RS59 ] . For automata on infinite 
words this is not the case. Deterministic Biichi automata are strictly weaker than nonde- 
terministic Biichi automata. However, for every nondeterministic Biichi automaton there 
exists an equivalent deterministic automaton with one of the stronger acceptance conditions. 
The best determinization constructions take nondeterministic Biichi or Streett automata 
and convert them to deterministic Rabin automata. We describe these two constructions 
below. 

We use acronyms in {A, D}x{R, S, P, B}x {W} to denote automata. The first symbol 
stands for the branching mode of the automaton: iV for nondeterministic and D for deter- 
ministic. The second symbol stands for the acceptance condition of the automaton: R for 
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Rabin, S for Streett, P for parity, and B for Biichi. The last symbol stands for the object 
the automaton is reading, in our case W for words. For example, a DRW is a deterministic 
Rabin word automaton and an NBW is a nondeterministic Biichi word automaton. 

3. Determination of Buchi Automata 

In this section we give a short exposition of Safra's determinization [Saf88] and show 
how to improve it. We replace the constant node names with dynamic names, which allow us 
to simulate the index appearance record construction within the deterministic automaton. 
We get a deterministic automaton with fewer states and in addition a parity automaton 
instead of Rabin. 

3.1. Safra's Construction. Here we describe Safra's determinization construction [Saf88, 
Saf89]. The construction takes an NBW and constructs an equivalent DRW. Safra con- 
structs a tree of subset constructions. Every node in the tree is labeled by the states it 
follows. The labels of siblings are disjoint and the label of a node is a strict superset of the 
union of the labels of its descendants. The sons are ordered according to their age. The 
transition of a tree replaces the label of every node by the set of possible successors. If 
the label now includes some accepting states, we add a new son to the node with all these 
accepting states. Intuitively, the states that label the sons of a node have already visited 
an accepting state. Thus, the states in the label of a node that are not in the labels of 
its descendants are states that still owe a visit to the acceptance set. If a state occurs in 
two sibling nodes (or more), we remove it from the younger sibling and keep it only in the 
older sibling. If the label of a node becomes equal to the union of labels of its descendants 
then we mark this node as accepting and remove all its descendants. If some node remains 
eventually always in the tree and is marked accepting infinitely often, the run is accepting. 
Formally, we have the following. 

Let M = (E,S,S,s ,a) be an NBW with \S\ = n. Let V = [n]. We first define Safra 
trees. 

Definition 3.1. A Safra tree t over S is (N,r,p,tfi,l, E, F) where the components oft are 
as follows. 

• iV C V is a set of nodes. 

• r E N is the root node. 

• p : iV — > N is the parent function defined over N— {r}, defining for every v € N—{r} its 
parent p(v). 

• ip is a partial order defining "older than" on siblings (i.e., children of the same node). 

• I : N — > 2 s is a labeling of the nodes with subsets of S. The label of every node is a proper 
superset of the union of the labels of its sons. The labels of two siblings are disjoint. 

• E, F C V are two disjoint subsets of V. They are used to define the Rabin acceptance 
condition. 

The following claim is proved in [Saf88l ISaf89l Uut971 IKV05] . 

Claim 3.2. The number of Safra trees over S is not more than (12) n n 2n . 

^The index appearance record is the gadget that allows to translate Rabin and Streett conditions to parity 
conditions Saf92], It is a permutation over the pairs in the Rabin / Streett condition with two pointers into 
the permutation. 
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Proof. The number of ordered trees on n nodes is the (n — l)th Catalan number. We 
know that Cat(n) = w! ^+i)i an d Cat(n — 1) < 4 n . We represent the naming of nodes by 
/ : [n] — » [n] that associates the zth node with its name f{i). There are at most ri n such 
functions. The labeling function is I : S — > [n] where = i means that s belongs to the zth 
node and all its ancestors. Finally, we represent E and F by a function a : V — > {0, F, F} 
such that a(i) = means that i ^ E U F, a{i) = E means that i £ E, and a(i) = F means 
that i E F. There are at most 3 n such functions. 

To summarize, the number of trees is at most 4 n • 3 n • n n ■ n n = (12) n n 2n . fj 

We construct the DRW T> equivalent to M . Let V = (S, D, p, do, a'} where the compo- 
nents of T> are as follows. 

• D is the set of Safra trees over S. For a state d £ D we denote by a c£ subscript the 
components of d. For example, A 7 ^ is the set of nodes of d and l d is the labeling of d. 

• do is the tree with a single node 1 labeled by {so} where F is V — {1} and F is the empty 
set. 

• Let a' = {(Ei,Fi), . . . ,{E n ,F n )} be the Rabin acceptance condition where E{ = {d € 
D \ i€ E d } and Fi = {d £ D \ i £ F d }. 

• For every tree d € D and letter u £ S the transition d' = p(d, a) is the result of the 
following transformations on d. We use temporarily the set of names V disjoint from V. 

(1) For every node v with label S' replace 5' by 5(S', a) and set E and F to the empty 
set. 

(2) For every node v with label S' such that 5'fla ^ 0, create a new node v' G V which 
becomes the youngest child of v. Set its label to be 5' n a. 

(3) For every node v with label S' and state s € S' such that s also belongs to the label 
of an older sibling v' of v, remove s from the label of v and all its descendants. 

(4) Remove all nodes with empty labels. 

(5) For every node v whose label is equal to the union of the labels of its children, remove 
all descendants of v. Add v to F. 

(6) Add all unused nodes to E. 

(7) Change the nodes in V' to nodes in V. 

Claim 3.3. The transition is well defined. 

Proof. It is simple to see that the label of every node is a proper superset of the union of 
the labels of its children and that the labels of two siblings are disjoint. We have to show 
that the n nodes in V are sufficient to complete step 7. As the labels of siblings are disjoint 
and the union of labels of children is a proper subset of the label of the parent it follows 
that every node is the minimal (according to the subset order on the labels) to contain (at 
least) some state s & S. It follows that n node names are sufficient. □ 

Theorem 3.4. [Saf88] L(V) = L(N). □ 

For other expositions of this determinization we refer the reader to [Jut97 , L6d98, Rog01|. 



3.2. From NBW to DPW. We now present our construction. Intuitively, we take Safra's 
construction and replace the constant node name with a dynamic one that decreases as nodes 
below it get erased from the tree. Using the new names we can give up the "older than" 
relation. The smaller the name of a node, the older it is. Furthermore, the names give a 
natural parity order on good and bad events. Erasing a node is a bad event (which forces 
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all nodes with greater name to change their name). Finding that the label of some name 
is equal to the union of labels of its descendants is a good event. The key observation is 
that a node can change its name at most a finite number of times without being erased. 
It follows that the names of all nodes that stay eventually in the tree get constant. Thus, 
bad events happen eventually only to nodes that get erased from the tree. Then we can 
monitor good events that happen to the nodes with constant names and insist that they 
happen infinitely often. Formally, we have the following. 

Let TV = (£, S, 5, so, a) be an NBW with \S\ = n. For the sake of the proof we would 
like to treat the nodes as entities. Hence, we distinguish between the set of nodes V = [2n] 
of a tree and their names that may change and range over [re]. All important information 
(tree structure, label) can be associated with the names and in practice the distinction 
between nodes and their names is not needed. 

Definition 3.5. A compact Safra tree t over S is (N, M, l,p, I, e, f) where the components 
of t are as follows. 

• N C V is a set of nodes. 

• M : N — > [re] is the naming function. 

• 1 G N such that M(l) = 1 is the root node. 

• p : N — > N is the parent function. 

• I : N — > 2 s is a labeling of the nodes with subsets of S. The label of every node is a proper 
superset of the union of the labels of its sons. The labels of two siblings are disjoint. 

• e, / G [re + 1] are used to define the parity acceptance condition. The number e is used 
to memorize the minimal node that changed its name and / the minimal node that is 
equivalent to its descendants. 

Notice that we give up the "older than" relation and replace the sets E and F by numbers 
e and /. We require that the naming M is a bijection from N to [|iV|]. That is, the names 
of the nodes in N are consecutive starting from the root, which is named 1. 
The following claim is proved much like the similar proof for Safra trees. 

Claim 3.6. The number of compact Safra trees over S is not more than 2n n n\. 

Proof. Just like Safra trees there are at most n nodes. We use only the names of the 
nodes. The parent of the node has a smaller name. Thus, the parenthood relation can be 
represented by a sequence of (at most) n — 1 pointers, where the ith pointer is pointing 
to a value in 1, . . . , i— 1. It follows that there are at most (re— 1)! such trees. As in Safra 
trees, every node has at least one unique state in S that belongs to it. We add the function 
I : S — > [re] that associates a state with the minimal node (according to the descendant 
order in the tree) to which it belongs. There are re options for e and / each. In order to 
define the acceptance condition (see below) we need to know the value of e in case e < f 
and the value of / in case f < e. Thus, we need 2n possible values. It follows that there 
are at most 2n ■ (re— 1)! • re™ = 2n n n\ different compact Safra trees. □ 

We construct the DPW V equivalent to N. Let V = (S, D, p, do, a'} where the components 
of V are as follows. 

• D is the set of compact Safra trees over S. 

• do is the tree with a single node 1 labeled {so} an d named 1 where e = 2 and / = 1. 

• The parity acceptance condition a'={Fo, . . . , i*2 n _i) is defined as follows. 
- F = {d G D | / = 1 and e > 1} 
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- F 2i+ i = {d G D | e = i + 2 and / > e} 

- = {deD\f = i + 2 and e > /} 

Note that the case e = 1 is not considered above. In this case the label of the root is 
empty. This is a rejecting sink state. 
• For every tree d G D and letter a G £ the transition d' = p(d, a) is the result of the 
following transformations on d. 

(1) For every node v with label S' replace S' by 5(S',a). 

(2) For every node v with label S' such that 5' fl a / 9, create a new son v' ^ iV of t> . 
Set its label to 5" n a. Set its name to the minimal value greater than all used names. 
We may have to use temporarily names in the range [(n+l)..(2n)]. 

(3) For every node v with label S" and state s G S' such that s belongs also to some 
sibling v' of v such that M(v') < M(v), remove s from the label of v and all its 
descendants. 

(4) For every node v whose label is equal to the union of the labels of its children, remove 
all descendants of v. Call such nodes green. Set / to the minimum of and the 
names of green nodes. Notice that no node in [(n+l)..(2n)] can be green. 

(5) Remove all nodes with empty labels. Set e to the minimum of n+1 and the names of 
nodes removed during all stages of the transformation. Notice that the priority of a 
state is even only when f < e. Thus, green nodes that are removed cannot make a 
state of even priority. 

(6) Let Z denote the set of nodes removed during all previous stages of the transformation. 
For every node v let rem(v) be \{v' G Z \ M{v') < M(v)}\. That is, we count how 
many nodes are removed during the transformation and have smaller name than 
the name of v. For every node v such that l(v) ^ we change the name of v to 
M(v) — rem(v). It is simple to see that the resulting names are consecutive again 
and in the range [n]. 

We show that the two automata are equivalent. The proof is an adaptation of Safra's proof 
[Saf88] , 

Theorem 3.7. L(V) = L(M). 

Proof. Consider w E L(Af). We have to show w £ L{T>). Let r = sqS\ ■ ■ ■ be an accepting 
run of N on w. Let r' = dod\ • • • be the run of T> on w and let di = (iVj, Mj, l,pi, h, ej, fi). 
It is simple to see that for all z > we have Sj G hiX) and e$ > 1. If step 4 is applied 
infinitely often to node 1 (equivalently, / = 1 infinitely often, or during the transformation 
of the trees the label of 1 equals the labels of its sons) then r' visits -Fo infinitely often. 

Otherwise, from some point onwards in r 1 we have step 4 is not applied to node 1. Let 
io be this point. There exists a point i\ > io such that G a. It follows that for all 
i > i\ we have S{ belongs to some son v\ of 1. Notice, that just like in Safra's case, the run 
r may start in some son of 1 and move to a son with a smaller name. However, this can 
happen finitely often and hence we treat v\ as constant. The name M{v\) may decrease 
finitely often until it is constant. Let i 2 be such that for all i > i 2 we have a\ = Mj(-ui). As 
Mi(v\) = ai for all i > i 2 it follows that e, > a\ for all i > i 2 - 

Suppose that step 4 is applied to v\ infinitely often (equivalently, / < a\ infinitely 
often). It follows that for every odd a' < 2a\ — 2 we have F a > is visited finitely often and 
either F 2ai - 2 is visited infinitely often or there exists some even a' < 2a± — 2 such that F a > 
is visited infinitely often. In this case T> accepts w. 
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Otherwise, step 4 is applied to v\ finitely often. We construct by induction a sequence 
V\ , . . . , v k such that eventually vi,...,Vk do not change their names and r belongs to all of 
them. As the number of active nodes in a tree (nodes v such that l(v) ^ 0) is bounded by 
n we can repeat the process only finitely often. Hence, w is accepted by T>. 

In the other direction, consider w G L(T>). Let r' = dtydi • • • be the accepting run of T> 
on w where di = (iVj, Mj, l,Pi, Zj, /j, ej). Let Fi a be the minimal set to be visited infinitely 
often. It follows that eventually always ej > a + 1 and infinitely often /j = a + 1. We first 
prove two claims. 

Claim 3.8. For every i G N, v G Ni, and every state s G h{v) we have s is reachable from 
sq reading w[0, i — 1]. 

Proof. We prove the claim simultaneously for all v G iVj by induction on i. Clearly, it holds 
for i = 0. Suppose that it holds for i. As C 5(li(v'),Wi) for some t/ G iVj it follows 

that every state in li + i(v) is reachable from sq reading io[0,i]. □ 

Claim 3.9. Consider i, i! G N such that i < i! , di, d^ G Fi a for some a, and for all a' < 2a 
and for all i < j < i' we have dj £ F a > . Then there exists a node v such that Mj (v) = a + 1 
for all i < j < %' and every state s in li>{v) is reachable from some state in li(v) reading 
w[i,i' — 1] with a run that visits a. 

Proof. There exists some node v such that Mi(v) = a + 1 (as di G i*2 a ). By assumption, 
for every a' < 2a the set F a i is not visited between i and i! . Hence, for every node v' such 
that Mi(v) < a + 1 we have that Mj(v') = Mi(v') for all i < j < %' . That is, between i and 
i' all nodes whose name is at most a + 1 do not change their names. In particular, for all 
i < j < i' we have Mj(v) = a + 1. We show that for every % < j < i' and every descendant 
v' of v, every state in lj{y') is reachable from some state in U{v) along a run visiting a. 
As v is a leaf in di for j = i this is obviously true. Suppose it is true for j and prove for 
j + 1. We know that for every descendant v 1 of v either lj + \{v') C S(lj(v),Wj) D a or for 
some descendant v" of v we have lj+i(v') C S(lj(v"),Wj) (v" may be v 1 ). As during the 
transformation from dy-\ to d# the label li>(v) equals the union of labels of sons of v the 
claim follows. In particular, if i' = i + 1 then all states in li>(v) are in a and we are done. Q 

We construct an infinite tree with finite branching degree. The root of the tree corre- 
sponds to the initial state of jV. Every node in the tree is labeled by some state of N and 
a time stamp i. An edge between the nodes labeled (s,i) and (i, j) corresponds to a run 
starting in s, ending in t, reading w[i,j — 1], and visiting a. From Konig's lemma this tree 
contains an infinite branch. The composition of all the run segments in this infinite branch 
is an infinite accepting run of M on w. 

Let (so, 0) label the root of T. Let io be the maximal location such that for all a' < 2a 
the set F a is not visited after i^. Let v be the node such that for all % > io we have 
Mi{v) = a + 1. Let i\ be the minimal location such that i\ > io and fi x = a + 1 (that is step 
4 was applied to v). For every state s in l^iv) we add a node to T, label it by (s,ii) and 
connect it to the root. We extend the tree by induction. We have a tree with leafs labeled 
by the states in kAv) stamped by time ij, and /j . = a + 1 (step 4 was applied to v). That 
is, for every state s in li 3 {v) there exists a leaf labeled (s,ij). We know that F2 a is visited 
infinitely often. Hence, there exists ij + \ > ij such that fi j+1 = a + 1 (step 4 is applied to 
v). For every state s' in h j+1 (v) we add a node to T and label it (s',ij + i). From Claim [3791 
there exists a state s in h {v) such that s' is reachable from s reading w[ij,ij+x — 1] with a 
run that visits a. We connect (s',ij + i) to (s,ij). 
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From Claim [3781 it follows that every edge (so, 0), (s', i\) corresponds to some run starting 
in so, ending in s', and reading w[0, i± — 1]. From Claim [3T9l every other edge in the tree 
(s,ij), (s',ij + i) corresponds to some run starting in s, ending in s', reading w[ij, ij + \ — 1], 
and visiting a. From Konig's lemma there exists an infinite branch in the tree. This infinite 
branch corresponds to an accepting run of on w. □ 

Theorem 3.10. For every NBW M with n states there exists a DPWD with 2n n n\ states 
and index 2n such that L(T>) = L(N). □ 

We note that this improves Safra's construction in two ways. First, we reduce the number 
of states from (12)™n 2n to 2n n n\. Second, our automaton is a parity automaton which 
is amenable to simpler algorithms. For example, given a DPW it is possible to check 
in polynomial time what is the minimal parity index that enables recognition of the same 
language, and to find such an optimal parity index on the same automaton structure [CM99] . 
On the other hand, finding the minimal Rabin index of a DRW is NP-hard [KPBV95J 
and it may be the case that an optimal condition cannot be found on the same structure 
[KMM04]. Many times we are interested in a deterministic automaton for the complement 
language, a process called co-determinization. The natural complement of a DRW is a DSW. 
However, the Streett acceptance condition is less convenient in many applications (due to the 
fact that Streett acceptance conditions require memory). Thus, the complement automaton 
is usually converted to a DPW using the IAR construction [Saf92|. In such a case, one would 
have to multiply the number of states by k 2 k\ where k is the number of Rabin pairs. A 
similar effect occurs when using deterministic automata in the context of games. Solution 
of Rabin games incurs an additional multiplier of k 2 k\. With our construction this penalty 
is avoided. 

4. DETERMINIZATION OF STREETT AUTOMATA 

In this section we give a short exposition of Safra's determinization of Streett automata 
|Saf92| and show how to improve it. Again, we replace the constant node names with 
dynamic names. We get a deterministic automaton with fewer states and in addition a 
parity automaton instead of Rabin. The intuition is similar to the construction in Section [3j 

4.1. Safra's Construction. Here we describe Safra's determinization for Streett Automata 
[Saf92]j. The construction takes an NSW and constructs an equivalent DRW. 

As mentioned, in the case of Streett automata, determinization via conversion to Biichi 
automata is less than optimal. Safra generalizes his construction to work for Streett au- 
tomata. The idea is still to use a set of subset constructions. Let S = (£, S, 5, so, a) be an 
NSW where a = {{Ri, G\},. . . , {R^, G^}}. We say that a run r of S is accepting according 
to the witness set J C [k] if for every j € J we have inf{r) n R j ^ and for every j (jz J we 
have inf(r) n Gj = 0. It is easy to construct an NBW whose language consists of all words 
accepted according to witness set J. The NBW has two parts. In the first part it waits until 
all visits to Gj for j ^ J have occurred. Then it moves nondeterministically to the second 
part where it waits for visits to Rj for each j £ J according to their order and disallows 
visits to Gj for every j ^ J. If the automaton loops through all j € J infinitely often the 
run is accepting. Unfortunately, the number of possible witness sets is exponential. 

Safra's construction arranges all possible runs of the NSW and all relevant witness sets 
in a tree structure. A state is again a tree of subset constructions. Every node in a tree 
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represents a process that is monitoring some witness set and checking this witness set. The 
node for witness set J follows some set of states. It waits for visits to Rj for every j € J 
(in descending order), if this happens without visiting Gj for j ^ J then the node succeeds 
and starts all over again. 

A Streett Safra tree is a tree whose nodes are labeled by subsets of the states in S. 
The labels of siblings are disjoint and the labels of sons form a partition of the label of the 
parent. In addition every node is annotated by a subset JC [k]. The annotation of a son 
misses at most one element from the annotation of the parent. Every node that is not a leaf 
has at least one son whose annotation is a strict subset. In addition, children are ordered 
according to their age. 

The root node monitors the set [k] as a possible witness set. If some node is annotated 
with J and has a child annotated J — {j} this means that the child has given up on the 
hope that Rj will occur. If a node has given up on Rj but visits Gj then the states visiting 
Gj have no place in this node and they are moved to a new sibling. Similarly, if a node has 
given up on Rj and visits Rj then the states visiting Rj have no place in this node and they 
are moved to a new sibling. Whenever the label of a node gets empty it is removed from 
the tree. If all the states followed by a node completed a cycle through its witness set, all 
the descendants of this node are removed and it is marked accepting. The Rabin condition 
associates a pair with every node. A run is accepting if some node is erased finitely often 
and marked accepting infinitely often. 

Let S = (S, S, 5, so, a) be an NSW where a = {(Ri, G\), . . . , (Rk, Gk)} and \S\ = n. 
Let m = n(k + 1) and V = [m]. We first define Streett Safra trees. 

Definition 4.1. A Streett Safra tree t over S is (N, r,p, ijj, I, h, E, F) where the components 
of t are as follows. 

• iV C V is the set of nodes. 

• r £ N is the root node. 

• p : N — > N is the parent function defined over N—{r}, defining for every v £ N—{r} its 
parent p(v). 

• ip is a partial order defining "older than" on siblings (i.e., children of the same node). 

• I : N — > 2 s is a labeling of nodes with subsets of S. The label of every node is equal to 
the union of the labels of its sons. The labels of two siblings are disjoint. 

• h : N —* 2^ annotates every node with a set of indices from [k]. The root is annotated 
by [k]. The annotation of every node is contained in that of its parent and it misses at 
most one element from the annotation of the parent. Every node that is not a leaf has 
at least one son with strictly smaller annotation. 

• E,F C V are two disjoint subsets of V. They are used to define the Rabin acceptance 
condition. 

The following claim is proved in [Saf92l [SchOl]. 

Claim 4.2. The number of Streett Safra trees over S is at most 

(12) n ( fc+1 V^+l) n(fc+1) (n(/fc+l)r (fe+1) • 

Proof. The number of ordered trees on m nodes is at most 4 m . We represent the naming 
of the nodes by / : [m] — > [m]. There are at most m m such functions. The labeling of a 
node is determined by the labels of the leaves in the subtree below it and labels of leaves 
are disjoint. The labeling function S — * [n] associates a state s with the leaf it belongs to. 
There are at most n leaves and n n such functions. We can represent the annotation h by 
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annotating every edge by the value j G [k] such that j is in the annotation of the parent 
and not in the annotation of the son. If no such j exists then we annotate the edge by 0. 
The edge annotation function is h : [m] — ► [0..k] associating an index to the target node of 
the edge. Finally, E and F are represented by a function a : V — > {®,E, F}. The number 
of trees is at most 4 m • 3 m • m m ■ n n ■ (k + l) m = (12) n ( fc+1 )n n (/c+l) n ( fc+1 ) (n(fc+l)) n(fc+1 ) . □ 

We construct the DRW T> equivalent to S. Let T> = (£, D, p, do, a') where the components 
of T> are as follows. 

• D is the set of Streett Safra trees over S. 

• do is the tree with a single node 1 labeled by {so} where E is V — {1} and F is the empty 
set. 

• Let a' = {(Ei,F\), . . . , (E m ,F m )} be the Rabin acceptance condition where Ei = {d G 
D | i G E d } and F< = {d G D \ i G F d }. 

• For every tree d G D and letter a G £ the transition d! = p(d, a) is the result of the 
following (recursive) transformation applied on d starting from the root. Before we start, 
we set E and F to the empty set and replace the label of every node v by S(l(v),a). We 
use temporarily the set of names V disjoint from V. 

(1) If v is a leaf such that h(v) = stop. 

(2) If v is a leaf such that h(v) ^ 0, add to v a new son v' G V. Set l(v') = l(v) and 
h(v') = h(v) — {max(h(v))}. 

(3) Let vi, . . . , vi be the sons of v (ordered from oldest to youngest) and let ji, . . . ,ji be 
the indices such that ji G h(v) — h(vi) (note that \h(v) — h{vi)\ < 1; in case that 
h(v) = h{vi) we have ji = 0). Call the entire procedure recursively on v\, . . . ,vi (call 
recursively also for sons created in step 2 above). 

For every son and every state s G l(vi) do the following. 

(a) If s G Rj i , remove s from the label of V{ and all its descendants. Add a new 
youngest son v' G V to v. Set l(v') = {s} and h(v') = h{y) — {max({0} U (h(v) D 

(b) If s G Gj t , remove s from the label of Vi and all its descendants. Add a new 
youngest son v' G V to u. Set l(v') = {s} and h(v') = h(v) — {ii}0 

(4) If a state s appears in l(vi) and i(wj') and ji < jV then remove s from the label of 
and all its descendants. 

(5) If a state s appears in l(vi) and l(vi>) and = jV then remove s from the label of the 
younger sibling and all its descendants. 

(6) Remove sons with empty label. 

(7) If all sons are annotated by h(v) remove all the sons and all their descendants. Add 
v to F. 

Finally, we add all unused names to E, remove unused names from F, and change the 
nodes in V' to nodes in V . 

Claim 4.3. The transition is well defined. 

We note that in Safra's original construction (Saf92, SchOl] the rank of the new node is set to h(y') = 
h(v) — {max(h(v))} . In case that both Gj { and Rj i are visited infinitely often this may lead to the following 
situation. Suppose that the node v has a son v that is waiting for a visit to Rj i where ji is not the maximum 
in h(v). In the case that Gj i is visited, the runs are moved to new siblings that await max{h(v)) again. 
This way, the run may cycle infinitely often between max(h(v)) and ji, leading to incompleteness of the 
construction. 
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Proof. It is simple to see that the label of every node is equal to the union of the labels of 
its children and that the labels of two siblings are disjoint. We have to show that the m 
nodes in V are sufficient to change the nodes in V to nodes in V . 

There exists a path from the root to a leaf where no edge is annotated by 0. For every 
edge annotated 0, there is a path from the target of this edge to a leaf where no edge is 
annotated by 0. Hence, there are at most n — 1 edges annotated by 0. Every other edge is 
either annotated by some index i £ [k] or connects a parent v to a son v' such that there 
is some state s £ S such that s E l(v) and s ^ l(v'). Thus, there can be at most nk such 
edges. Totally, n(k + 1) node names are sufficient. □ 

Theorem 4.4. {Saf92] L(V) = L(S). 

For other expositions of this determinization we refer the reader to [Jut97j ISch01| . 

4.2. From NSW to DPW. We now present our construction. Let S = (L,S,6,so,a) 
be an NSW where a = {{Ri, G\), . . . , {R^, Gt}} and \S\ = n. Denote m = n{k + 1). For 
the sake of the proof, we distinguish between the set of nodes V = [2m] of a tree and 
their names that range over [m]. All important information (tree structure, label) can be 
associated with the names and in practice the distinction between nodes and their names 
is not needed. 

Definition 4.5. A compact Streett Safra tree t over S is (N,M,l,p,l,h, e, /) where the 
components of t are as follows. 

• N C V is a set of nodes. 

• M : N — > [m] is the naming function. 

• 1 G N such that M(l) = 1 is the root node. 

• p : N — ► N is the parent function. 

• I : N —> 2 s is a labeling of the nodes with subsets of S. The label of every node is equal 
to the union of the labels of its sons. The labels of two siblings are disjoint. 

annotates every node with a set of indices from [k]. The root is annotated 
by [k]. The annotation of every node is contained in that of its parent and it misses at 
most one element from the annotation of the parent. Every node that is not a leaf has 
at least one son with strictly smaller annotation. 

• e, / € [m + 1] are used to define the parity acceptance condition. 

Notice that we give up the "older than" relation and replace the sets E and F by 
numbers e and /. The naming M is a bijection from N to [\N\]. That is, the names of 
nodes in N are consecutive starting from the root, which is named 1. 

The following claim is proved much like the similar proof for Streett Safra trees. 

Claim 4.6. The number of compact Streett Safra trees over S is not more than 

2n n (k+l) n ( k+1 Xn(k+l))\. 

Proof. Just like Streett Safra trees there are at most m nodes. We use only the names of 
the nodes. The parent of the node has a smaller name. Thus, the parenthood relation can 
be represented by a sequence of (at most) m — 1 pointers, where the ith pointer is pointing 
to a value in 1, . . . , %— 1. It follows that there are at most (m— 1)! such trees. As the labels 
of the leaves form a partition of the set of states S there are at most n leaves. We add 
the function I : S — > [n] that associates a state with the unique leaf to which it belongs. 
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Setting l(s) = i means that s belongs to the ith leaf. We can represent the annotation h 
by annotating every edge by the value j G [k] such that j is in the annotation of the parent 
and not in the annotation of the son. If no such j exists then we annotate the edge by 0. 
The edge annotation is represented by a function h : [m] — > [0, . . . , k]. In order to define 
the acceptance condition (see below) we need to know the value of e in case e < f and the 
value of / in case / < e. Thus, we need 2m possible values. 

It follows that the number of compact Streett Safra tress is at most 2m ■ (m— 1)! • n n ■ 
(jfe + l) m = 2m! • n n ■ (k + l) m = 2n n {k+l)< k+1 \n(k+l))\. □ 

We construct the DPW T> equivalent to S. Let T> = (£, D, p, do, a') where the components 
of V are as follows. 

• D is the set of compact Streett Safra trees over S. 

• do is the tree with a single node 1 labeled {sq}, named 1, and annotated [k]. We set e = 2 
and / = 1. 

• The parity acceptance condition a'=(Fo, . . . , F2 m -i) is defined as follows. 

- F = {d G D | / = 1 and e > 1} 

- F 2i+1 = {d G D | e = i + 2 and / > e} 

- F 2i+2 = {d G D | / = i + 2 and e > /} 

As before, the case where e = 1 is a rejecting sink state. 

• For every tree d G D and letter a G S the transition d' = p(<i, cr) is the result of the 
following (recursive) transformation applied on d starting from the root. Before we start, 
we set e and / to m + 1 and replace the label of every node v by S(l(v),a). 

(1) If v is a leaf such that h(v) = stop. 

(2) If v is a leaf such that h(v) ^ 0, add to v a new son v' . Set l(v') = l(v), h(v') = 
h(v) — {max(h(v))}, and set M(v') to the minimal value greater than all used names. 
We may use temporarily names out of the range [m\. 

(3) Let vi, . . . , vi be the sons of v (ordered according to their names) and let ji, ■ ■ ■ ,ji be 
the indices such that ji = max((h(v) U {0}) — h(vi)) (note that \h(v) — h(vi)\ < 1; 
in case that h(v) = h(vi) we have ji = 0). Call recursively the entire procedure on 
vi,...,vi (including sons created in step 2 above). 

For every son Vi and every state s G l(vi) do the following. 

(a) If s G Rj i , remove s from the label of Vi and all its descendants. Add a new son 
v 1 to v. Set l(v') = {s}, h(v') = h(v) - {max({0} U (h(v) n {1, . . . , ji - 1}))}, and 
set M(v') to the minimal value larger than all used names. 

(b) If s G Gj t , remove s from the label of Vi and all its descendants. Add a new son 
v 1 to v. Set l(v') = {s}, h(v') = h(v) — {ji}, and set M{y') to the minimal value 
larger than all used names. 

(4) If a state s appears in l(vi) and /(iv) and ji < jV then remove s from the label of vy 
and all its descendants. 

(5) If a state s appears in l(vi) and l(vi>), ji = jy, and M(vi) < M(«j/) then remove s 
from the label of and all its descendants. 

(6) Remove sons with empty label. Set e to the minimum of its previous value and the 
minimal name of a removed descendant. 

(7) If all sons are annotated by h(v) remove all sons and all their descendants. Set e to 
the minimum of its previous value and the minimal name of a removed descendant. 
Set / to the minimum of its previous value and the name of v. 
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Let Z denote the set of nodes removed during this recursive procedure. For every node 
v let rem(v) be \{v' E Z \ M(y') < M(v)}\. That is, we count how many nodes got 
removed during the recursive transformation and their name is smaller than the name of 
v. For every node v such that l(v) ^ we change the name of v to M{v) — rem(v). The 
resulting names are consecutive again and in the range [m]. 

We show that the two automata are equivalent. The proof is an adaptation of Safra's proof 
[Saf92] . 

Theorem 4.7. L(V) = L{S). 

Proof. Consider w E L(S). We have to show w E L(D). Let r = sqsi ■ ■ ■ be an accepting 
run of S on w. Let J C [k] be the maximal witness set for r. Let r' = dod\ • • • be the run of 
Ponjo and let di = (N, Mi, l,pi,li,hi,ei, fi). It is simple to see that for all ? > we have 
Si E and e$ > 1. Let i\ be the location such that for all i > i\ we have Si E inf(r). 
That is, all states appearing after i\ appear infinitely often in the run. In particular, for all 
i > i\ we have Sj ^ Gj for all j ^ J. 

If step 7 is applied infinitely often to node 1 (equivalently, / = 1 infinitely often, or 
during the application of transitions the descendants of 1 are all annotated by [k]) then r' 
visits Fq infinitely often. Otherwise, from some point onwards in r' we have step 7 is not 
applied to node 1. Let i<i > i\ be this point. It follows that for all i > 12 node 1 is not a 
leaf. Then for all i > i<i we have Si appears in the label of some son of 1. This son can 
be changed a finite number of times. The annotation of the edge to the son containing r 
can only decrease. If the edge is annotated by some j 6 J then r eventually visits again 
Rj and r is migrated to some son annotated by j' < j. If the edge is annotated by some 
j (ji J then r never visits Gj again and the only way to migrate to a different son is if r 
somehow appears again in a different son with smaller annotation, or if r appears again in 
a different son with smaller name. Obviously, this can happen a finite number of times and 
eventually r stays in the same son of 1. The edge to this son is either annotated by or by 
some ji £ J. Formally, let i% > 12 be such that for all i > 13 we have Si appears in li{v\) 
and is a son of 1. We know that for all i > we have J C hi{v\). The name M{v\) may 
decrease finitely often until it is constant. Let 24 > ^3 be such that for all i > 24 we have 
a\ = Mi{v\). As Miiyx) = a\ for all i > 14 it follows that ej > a\ for all i > 24. 

If step 7 is applied to node v 1 infinitely often then we are done. Otherwise, we construct 
by induction a sequence l,v\, . . . ,v Q such that eventually v\, . . . ,v Q do not change their 
names and r appears in the label of all of them. Furthermore, we have J C h(v Q ) (which 
implies that J C h(v /) for all o € [l-.o]). As the number of active nodes in a tree is bounded 
by m we can repeat the process only finitely often. Hence, w is accepted by T>. 

In the other direction, consider w G L(T>). Let r' = d$d\ • • • be the accepting run of 
T> on w where di = (Ni,Mi,l,pi,li,hi,ei, fi). Let F% a be the minimal set to be visited 
infinitely often. It follows that eventually always > a + 1 and infinitely often fi = a + 1. 

We write in short avoids G- instead of avoids Gj for every j £ J and visits Rj instead 
of visits Rj for every j £ J. We first prove two claims. 

Claim 4.8. For every i G N, v E Ni, and every state s E h(v) we have s is reachable from 
so reading w[0, i — 1]. 

Proof. We prove the claim simultaneously for all v E Ni by induction on i. Clearly, it holds 
for i = 0. Suppose that it holds for i. As li + i(v) C 5(li(v'),Wi) for some v' E N it follows 
that every state in li + i(v) is reachable from sq reading w[0, i]. □ 
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The following claim shows that if some node v is colored green twice, and it is not 
removed and does not change its name between the two greens, then all the runs followed 
by v visit R 7 , where J is the annotation of v. We essentially prove that all states that are 
followed by a son v' of v annotated by j £ J are endpoints for runs that already visited Rj/ 
for all f £ J such that j' > j. When v is colored green the second time, all states followed 
by v are in sons annotated 0. This means that all Rj is visited. 

Claim 4.9. Consider i, i' £ N such that i < i' , di, dy £ F2 a for some a, and for all a' < 2a 
and for every o £ we have d Q ^ F a > . Then there exists a node v such that M (v) = a+1 
for every o £ [i..i'\ and every state s in li>(v) is reachable from some state in U(v) reading 
w[i,i' — 1] with a run that avoids G- and visits R } . 

Proof. As di £ F 2a , there exists some node v such that M^v) = a + 1. By assumption, for 
every a' < 2a the set F a i is not visited between i and i! . Hence, for every node v' such that 
Mi(y') < a + 1 and for every o £ [i..i'] we have that M {v') = Mj(f'). That is, between i 
and i! all nodes whose name is at most a + 1 do not change their names. In particular, for 
every o £ we have M Q (v ) = a + 1. In addition, there exists J C [A;] such that for every 
o £ [i..i'] we have h Q {y) = J. 

We find a run followed by node v between i and i! that avoids G- and visits R } . 

We first show that all runs followed by v avoid G_. One of the invariants maintained 
by the transition is that if a node v is annotated by a set J then it cannot be labeled by 
states in Gj for j ^ J. Formally, suppose that for some o £ [i..i'] there exists s £ Z (v) such 
that s £ for some j ^ J. Let t> ' be the youngest (according to the parenthood relation) 
ancestor of v such that j £ h Q (v') and let v" be the son of v' that is an ancestor of v (it may 
be v itself). It follows that the edge from v' to v" is labeled by j. Then, when applying 
step 3b on the transformation from d Q -\ to d Q the state s would have been moved from v" 
to some other son of v'. 

We show now that for every o £ [i..i'] and every s £ l (v) such that s appears in a son 
of v whose edge is annotated j £ J there exists a run starting in some state in li(v), visiting 
R[(j+i)..k]nJi reading w[i,o — 1], and ending in s. We prove this by induction on o. The first 
thing in the transformation from di to di + \ is to put all the elements in U+i(v) in a son 
labeled by max(J). Clearly, this satisfies our requirement. Suppose that it is true for o and 
prove for o+ 1. Consider a state s appearing in l 0+ \(v) in a son v' such that the edge (v, v') 
is annotated by j. If there is a predecessor of s in the same son in d Q then the claim follows 
(this covers the case where the same state appears in a node with smaller annotation or in 
a node with same annotation but smaller name). Otherwise, s appears in a son created by 
step 3a. It follows that there is some predecessor s' of s in a son v" of v in d Q such that 
(v, v") is annotated by the minimal j' > j such that j' £ J. Then, by induction there exists 
a run that ends in s' and visits R\(j'+i)..k]nJ- I n addition s is in Rj< . The claim follows. 

As during the transformation from c/j'_i to all the states s £ li'(v) are found in sons 
whose edge is annotated by we conclude that every state s £ k>(v) is reachable along a 
run that visits R T . □ 

We find a witness set J C [k] and construct an infinite tree with finite branching degree. 
The root of the tree corresponds to the initial state of S. Every node in the tree is labeled 
by some state of S and a time stamp i. An edge between the nodes labeled (s, i) and (t, i') 
corresponds to a run starting in s, ending in t, reading w[i, i! — 1], avoiding GL, and visiting 
Rj. From Konig's lemma this tree contains an infinite branch. The composition of all the 
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run segments in this infinite branch is an infinite accepting run of S on w according to 
witness set J. 

Let (so,0) label the root of T. Let i$ be the minimal location such that for all a' < 2a 
the set F a i is not visited after i$. Let v be the node such that for all i > iq we have 
Mi(v) = a + 1. Let J C [k] be such that for all i > iq we have hi(v) = J. Let i\ be the 
minimal location such that i\ > iq and = a + 1 (that is step 7 was applied to v). For 
every state s in /^(u) we add a node to T, label it by (s,i\) and connect it to the root. 
We extend the tree by induction. We have a tree with leaves labeled by the states in li o (v) 
stamped by time i Q , and fi a = a + 1 (step 7 was applied to v). That is, for every state s in 
k (v) there exists a leaf labeled (s,i Q ). We know that F^ a is visited infinitely often. Hence, 
there exists a minimal i 0+ \ > i such that fi Q+1 = a + 1 (step 7 is applied to v). For every 
state s' in Zi 0+1 (v) we add a node to the tree and label it (s',i 0+ i). From Claim |4"U1 there 
exists a state s in ^ D (f) such that s' is reachable from s reading w[i ,i 0+ i — 1] with a run 
that avoids G- and visits i? 7 . We connect (s',i 0+ i) to (s,i ). 

From Claim l48l it follows that every edge (sq, 0), (s', «i) corresponds to some run starting 
in so, ending in s', and reading w[0,ii — 1]. From Claim I4l9l every other edge in the tree 
(s,i ), (s',i 0+ i) corresponds to some run starting in s, ending in s', reading w[i ,i 0+ i — 1], 
avoiding G_, and visiting Rj. From Konig's lemma there exists an infinite branch in the 
tree. This infinite branch corresponds to an accepting run of 5 on w. O 

Theorem 4.10. For every NSW S with n states and index k there exists a DPW T> with 
2n n (k+l) n{ - k+1 \n{k+l))\ states and index 2n(k + 1) such that L(V) = L(S). □ 

As before, when compared to Safra's construction, we reduce the number of states and get 
a parity automaton. The advantages are similar to those described in Section [3l 

5. Conclusions and Future Work 

We improved both of Safra's determinization constructions. In both cases, we reduce 
the number of states and more important construct directly a parity automaton. In the case 
of NBW we reduce the maximal number of states from (12) n n 2n to 2n n n\. In the case of 
NSW we reduce the maximal number of states from (12) n ( fc+1 )n n (/c+l) n ( fe+1 )(n(fc+l)) n ( fe+1 ) 
to 2ri a {k+\) n ( k+1 \n(k+\))\. The fact that our automata are parity automata makes them 
easier to use 'down the line'. The algorithms for solving parity games are much simpler 
than those that solve Rabin games. In particular, Rabin games are NP-complete in the 
Rabin index while parity games are known to be in NPfico-NP. The complement of a DPW 
is again a DPW. In contrast, the complement of a DRW is a DSW. In order to get back 
to Rabin (or parity) one has to multiply the number of states by k 2 k\, where k is the 
number of Rabin pairs of the automaton. Our upper bound improves the best known upper 
bound in numerous applications, such as solving games, complementation of tree automata, 
emptiness of alternating tree automata, satisfiability of /n-calculus with backward modalities 
and CTL*. In particular, in the recent emptiness algorithm for alternating parity tree 
automata |KV05| the upper bound is reduced from (12) n n 4n +2n (n!) n to (2n n n!) 2n . 

There are lower bounds for both determinization constructions. For an NBW with n 
states the best possible DPW has at least n! states [Mic88|. For an NSW with n states 
and k Streett pairs the best possible DPW has at least (Q(nk)) n states [Yan06| . We have 
gotten closer to this lower bound however there is still a large gap between the lower bound 



DETERMINIZATION OF BUCHI AND STREETT AUTOMATA 



19 



and the upper bound. We are not aware on similar lower bounds on the index of the 
resulting automata. As DPW[k+l] recognize more languages than DPW[k] |Wag79| and 
NBW recognize all w-regular languages we cannot hope for a determinization construction 
with constant index. The language = {w £ [L.k]^ \ min{inf{w)) is even} is in DPW[k] 
but not in DPW[k-l]. It is simple to construct an NBW with k states recognizing L^. This 
suggests that a determinization of NBW with k states may result in DPW with k priorities. 
It is an interesting question whether the 2k priorities produced by our construction are 
indeed necessary. A similar question arises for NSW. 
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